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1 Introduction 

Our purpose is to consider the problem of hidden information; that is, a game 
between two economic actors, one of whom possesses mutually relevant informa- 
tion that the other does not. This is a common situation: The classic example 
being the "game" between a monopolist, who doesn't know the consumer's will- 
ingness to pay, and the consumer, who obviously does. Within the realm of 
contract theory, relevant situations include a seller who is better informed than 
a buyer about the cost of producing a specific good; an employee who alone 
knows the difficulty of completing a task for his employer; a divisional manager 
who can conceal information about his division's investment opportunities from 
headquarters; and a leader with better information than her followers about the 
value of pursuing a given course of action. In each of these situations, having 
private information gives the player possessing it a potential strategic advantage 
in his dealings with the other player. For example, consider a seller who has 
better information about his costs than his buyer. By behaving as if he had 
high costs, the seller can seek to induce the buyer to pay him more than she 
would if she knew he had low costs. That is, he has an incentive to use his 
superior knowledge to capture an "information rent." Of course, the buyer is information rent 
aware of this possibility; so, if she has the right to propose the contract between 
them, she will propose a contract that works to reduce this information rent. 
Indeed, how the contract proposer — the principal — designs contracts to mitigate 
the informational disadvantage she faces will be a major focus of this reading. 

Not surprisingly, given the many applications of the screening model, our 
coverage of it cannot hope to be fully original. 1 Indeed, while there are idiosyn- 
cratic aspects to our approach, our treatment is quite standard. 

2 The Basics of Contractual Screening 

Let us begin by broadly describing the situation in which we are interested. We 
shall fill in the blanks as we proceed through this reading. 

• Two players are involved in a strategic relationship; that is, each player's 
well being depends on the play of the other player. 

• One player is better informed (or will become better informed) than the 

other; that is, he has private information about some state of nature private information 
relevant to the relationship. As is typical in information economics, we 

refer to the player with the private information as the informed player informed player 

and the player without the private information as the uninformed player, uninformed player 

• Critical to our analysis of these situations is the bargaining game that 
determines the contract. We will refer to the contract proposer as the 



lr rhe books by Laffont and Tirolc (1993), Salanie (1997), and Macho-Stadler and Perez- 
Castrillo (1997) include similar chapters, although the emphasis varies widely among them. 
Surveys have also appeared in journals (see, e.g., Caillaud et al., 1988). 
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principal and the player who receives the proposal as the agent. More- 
over, we assume contracts are proposed on a take-it-or-leave-it basis: The 
agent's only choices are to accept or reject the contract proposed by the 
principal. Rejection ends the relationship between the players. A key 
assumption is that the principal is the uninformed player. Models like 
this, in which the uninformed player proposes the contract, are referred 
to as screening models. In contrast, were the informed player the contract 
proposer, we would have a type of signaling model. 

• A contract can be seen as setting the rules of a secondary game to be 
played by the principal and the agent. 

We presume that the asymmetry of information that exists in this game 
results because prior experience or expertise, location, or other factors give the 
agent free access to information about the state of nature; while the absence of 
expertise, different experience or location, or other factors exclude the principal 
from this information (make it prohibitively expensive for her to acquire it). 
For example, past jobs may tell a seller how efficient he is — and thus what 
his costs will be — while ignorance of these past jobs means the buyer has a 
less precise estimate of what his costs will be. We assume that the reason for 
this asymmetry of information is exogenous. In particular, the informed player 
is simply assumed to be endowed with his information for the purpose of the 
situation we wish to model. Here, we assume that only one player is better 
informed; that is, we are ruling out situations where each player has his or her 
own private information. 2 

Given this information structure, the two parties interact according to some 
specified rules that constitute the extensive form of a game. In this two-person 
game, the players must contract with each other to achieve some desired out- 
come. In particular, there is no ability to rely on some exogenously fixed and 
anonymous market mechanism. Our focus will be on instances of the game 
where the informed player can potentially benefit from his informational advan- 
tage (e.g., perhaps inducing a buyer to pay more for a good than necessary be- 
cause she fears the seller is high cost). But, because the informed player doesn't 
have the first move — the uninformed player gets to propose the contract — this 
informational advantage is not absolute: Through her design of the contract, the 
uninformed player will seek to offset the informed player's inherent advantage. 

3 The Two-Type Screening Model 

We will begin to formalize these ideas in as simple a model as possible, namely 
the two-type model. In the two-type model, the state of nature can take one of two-type model 
two possible values. As is common in this literature, we will refer to the realized 
state of nature as the agent's type. Given that there are only two possible state, type 



2 Put formally, the uninformed player's information partition is coarser than the informed 
player's information partition. 
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the agent can have one of just two types. 

Before proceeding, however, we need to emphasize that such simplicity in 
modeling is not without cost. The two-type model is "treacherous," in so far 
as it may suggest conclusions that seem general, but that are not. For exam- 
ple, the conclusion that we will shortly reach with this model that the optimal 
contract implies distinct outcomes for distinct states of nature — a result called 
separation — is not as general as it may seem. Moreover, the assumption of two separation 
types conceals, in essence, a variety of assumptions that must be made clear. 
It similarly conceals the richness of the screening problem in complex, more 
realistic, relationships. Our view is that few economic prescriptions and predic- 
tions should be reached from considering just the two-type model. Keeping this 
admonition in mind, we now turn to a simple analysis of private procurement 
in a two-type model. 

3.1 A simple two-type screening situation 

A large retailer (the principal) wishes to purchase units of some good for resale. 
Assume its size gives it all the bargaining power in its negotiations with the one 
firm capable of supplying this product (the agent) . Let x € R+ denote the units 
of this good and let r (x) denote the retailer's revenues from x units. 3 Assume 
that r (■) is strictly concave and differentiable everywhere. Assume, too, that 
r' (0) > 0. (Because r (•) is a revenue function, r (0) = 0.) 

The retailer is uncertain about the efficiency of the supplier. In particular, 
the retailer knows an inefficient supplier has production costs of Cj (x) , but 
an efficient supplier has production costs of Ce{x). Let the retailer's prior 
belief be that the supplier is inefficient with probability /, where — reflecting its 
uninformed status — < / < 1. The supplier, in contrast, knows its type; that 
is, whether it is efficient or not. 

Assume C\ (•) is increasing, everywhere differentiable, and convex for both 
types, t. (Because C t (•) is a cost function, we know C t (0) = 0.) Consistent with 
the ideas that the two types correspond to different levels of efficiency, we assume 
C'j (x) > C' E (x) for all x > — the inefficient type's marginal-cost schedule lies 
above the efficient type's. Observe, necessarily then, that Ci (x) > Ce (x) for 
all x > 0. 

The retailer and the supplier have to agree on the quantity, x, of the good 
to trade and on a payment, s, for this total quantity. Please note that s is 
not the per-unit price, but the payment for all x units. Profits for retailer and 
supplier are, then, r(x) — s and s — C t (x) respectively. The retailer makes a 
take-it-or-leave-it offer, which the supplier must either accept or refuse. If the 
supplier refuses the offer, there is no trade and each firm's payoff from this 
"transaction" is zero. This outcome, no agreement, is equivalent to agreeing to 



3 For convenience, we'll assume that the retailer incurs no costs other than those associated 
with acquiring the x units from the supplier. Alternatively, we could simply imagine that 
these other costs have been subtracted from revenues, so that r (rr) is profit gross of the cost 
of purchasing the x units. 
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trade units for a payment. Hence, it is without loss of generality to assume 
that the parties must reach some agreement in equilibrium. 

We begin our analysis with the benchmark case of symmetric or full infor- 
mation. That is, for the moment, we'll assume the retailer knows the supplier's 
type (i.e., f = or / = 1). We may immediately characterize the Pareto 
optimal allocation: xf units are traded, where 

xf = argmax {r(x) — C t (x)} . 

x>0 

Since the problem is otherwise uninteresting if trade is never desirable, let's 

assume that r'(0) > C' E (0) so that, with an efficient supplier at least, some 

trade is desirable. Pareto optimality, also referred to as ex post efficiency , then ex post efficiency 

reduces to 

r'(xf ) = C' E {x F E ) and [/ (xf ) - C\ (xf)] xf = 

(where we take the larger non-negative root of the second equation) . Because 
it is optimal to produce a greater amount when marginal costs arc lower, our 
assumptions about C' t (•) imply < xf < xf. In making its contract offer, the 
retailer sets x = xf and it offers a payment, sf , no larger than necessary to 
induce the supplier to accept; that is, sf satisfies 

sf - C t (xf) = 0. 

3.2 Contracts under incomplete information 

This symmetric information solution collapses when the retailer is uninformed 
about the state of nature. To see why, suppose that the retailer offered the 
supplier its choice of (xf, sf) or (xf , sf ) with the expectation that the supplier 
would choose the one appropriate to its type (i.e., the first contract if it were 
efficient and the second if it were not). Observe that the retailer is relying on 
the supplier to honestly disclose its type. Suppose, moreover, that the true state 
of nature is E. By truthfully revealing that the state of nature is E, the supplier 
would just be compensated for its cost of supplying xf units; that is, it would 
earn a profit of sf — Cs(xf) — 0. On the other hand, if the supplier pretends to 
have high costs — claims the state of nature is / — it receives compensation sf , 
while incurring cost C E (xf ) for supplying xf units. This yields the supplier a 
profit of 

sf-C E (xf) = 
C,(zf) - C £ (zf) > 

(recall sf — Cj (xf )). Clearly, then, the efficient-type supplier cannot be relied 
on to honestly disclose its type. 

This difference or profit, Ci(xf) — Cs(xf), which motivates the supplier to 
lie, is called an information rent. This is a loss to the retailer but a gain to information rent 
the supplier. There is, however, an additional loss suffered by the retailer that 
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is not recaptured by the supplier: Lying means inefficiently little is produced; 
that is, a real deadweight loss of 

[r (4) - C E (x F E )] - [r (xf) - C E (xf )] 

is suffered. 

Given this analysis, it would be surprising if the retailer would be so nai've as 
to rely on the supplier to freely reveal its type. In particular, we would expect 
the retailer to seek a means of improving on this ex post inefficient outcome by 
devising a more sophisticated contract. What kind of contracts can be offered? 
The retailer does not know the supplier's level of efficiency, so it may want to 
delegate the choice of quantity to the supplier under a payment schedule that payment schedule 
implicitly rewards the supplier for not pretending its costs are high when they 
are truly low. This payment schedule, S(-), specifies what payment, s = S(x), 
is to be paid the supplier as a function of the units, x, it chooses to supply. 
Wilson (1993) provides evidence that such payment schedules are common in 
real-world contracting. 

If the supplier accepts such a contract, the supplier's choice of quantity, x t , 
is given by 

x t e arg max {S(x) — C t (x)} . (1) 

x>0 

Assume for the moment that this program has a unique solution. Let u t denote 
the value of this maximization program and let St = S(x t ) be the supplier's 
payment under the terms of the contract. By definition, 

ut = s t - C t (x t ). 

Observe that this means we can write the equilibrium payment, s t , as 

s t = u t + C t (x t ) . 

We also define 

R(-) = C I (-)-C E (-) 

as the information-rent function. Our earlier assumptions imply that R(-) is rent function 
positive for x > 0, zero for x = 0, and strictly increasing. 

Revealed preference in the choice of x necessarily implies the following about 
xi and xe- 

u e = s E - C e (x e ) > si - C E (xi) = ui + R(xi) (2) 
m = si - Ci(xi) > s E - Ci(x E ) = u E - R(x E )- (3) 

These inequalities are referred by many names in the literature: incentive- 
compatibility constraints, self- select ion constraints, revelation constraints, and 
truth-telling constraints. Regardless of name, they simply capture the require- 
ment that (xi, si) and (xe, se) be the preferred choices for the supplier in states 
I and E, respectively. 



incentive constraint 
self-selection 
revelation 
truth-telling 
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What can we conclude from expressions (2) and (3)? First, rewriting them 



as 



it follows that 



R{xi) <u E -ui <R{x E ), (4) 

xi < x E , (5) 

because R(-) is strictly increasing. Observe, too, that expression (2) implies 
ue > Ui (except if xj = 0, in which case we only know ue > uj). Finally, 
expressions (2) and (5) implies se > sj (unless xe — xj, in which case (2) and 
(3) imply s E = Sj). 

Of course the contract — payment schedule S (•) — must be acceptable to the 
supplier, which means 

ul > 0; and (6) 

u H > 0. (7) 

If these did not both hold, then the contract would be rejected by one or the 

other or both types of supplier. The constraints (6) and (7) are referred to 

as the agent's participation or individual-rationality constraints. They simply participation 

state that, without any bargaining power, the supplier accepts a contract if and individually rational 

only if accepting does not entail suffering a loss. 

The retailer's problem is to determine a price schedule S(-) that maximizes 
its expected profit ("expected" because, recall, it knows only the probability 
that a give type will be realized) . Specifically, the retailer seeks to maximize 

/ x [r(x 7 ) - S/] + (1 - /) x [r(x E ) - s E ] ; or, equivalcntly, 

/ x [r(xi) - Cj (a;/) - it/] + (1 - /) x [r(x E ) - C E (x E ) - u E ] , 

where (x t ,u t ) are determined by the supplier's optimization program (1) in 
response to S(-). 

Observe that only two points on the whole price schedule enter the retailer's 
objective function: (xj,sj) and (xe,Se)', or, equivalcntly, (xj,uj) and (xe,ue)- 
The maximization of the principal's objectives can be performed with respect 
to just these two points provided that we can recover a general payment sched- 
ule afterwards such that the supplier would accept this schedule and choose 
the appropriate point for its type given this schedule. For this to be possible, 
we know that the self-selection constraints, (2) and (3), plus the participation 
constraints, (6) and (7), must hold. 

In fact, the self- selection constraints and the participation constraints on 
(x],si) and (xe,se) are necessary and sufficient for there to exist a payment 
schedule such that the solution to (1) for type t is (xt, s t ). To prove this asser- 
tion, let (xi, si) and (xe, se) satisfy those constraints and construct the rest of 
the payment schedule as follows: 

S(x) =0 if < x < Xl 
= si if xi < x < xe 
= se if xe < x, 
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when < xj < xe- 4 Given that C*(-) is increasing in x, no supplier would 
ever choose an x other than 0, xj, or xe (the supplier's marginal revenue is 
zero except at these three points). The participation constraints ensure that 
(xt,St) is (weakly) preferable to (0,0) and the self-selection constraints ensure 
that a type-t supplier prefers (x t ,s t ) to (x t >,s t >), t ^ t'. That is, we've shown 
that faced with this schedule, the type-/ supplier's solution to (1) is (xj, sj) — as 
required — and that the type-i? supplier's solution to (1) is (xe, se) — as required. 
The retailer's problem can thus be stated as 

max / x [r(xi) - Cj (xj) - uj] + (1 - /) x [r{x E ) - C E (x E ) - u E \ (8) 

{xi,xe,ui ,U E } 

subject to (2), (3), (6), and (7). Solving this problem using the standard La- 
grangcan method is straightforward, albeit tedious. Because, however, such 
a mechanical method provides little intuition, we pursue a different, though 
equivalent, line of reasoning. 

• One can check that ignoring the self-selection constraints (treating them 
as not binding) leads us back to the symmetric-information arrangement; 
and we know that at least one self-selection constraint is then violated. 
We can, thus, conclude that in our solution to (8) at least one of the 
self-selection constraints is binding. 

• The self- selection constraint in state E implies that: ue > R(xj) + uj > 
uj. Therefore, if the supplier accepts the contract in state /, it will also 
accept it in state E. We can, thus, conclude that constraint (7) is slack 
and can be ignored. 

• It is, however, the case that (6) must be binding at the optimum: Suppose 
not, then we could lower both utility terms ul and uh by some e > 
without violating the participation constraints. Moreover, since the two 
utilities have been changed by the same amount, this can't affect the 
self-selection constraints. But, from (8), lowering the utilities raises the 
principal's profits — which means our "optimum" wasn't optimal. 

• Using the fact that (6) is binding, expression (4) — the pair of self-selection 
constraints — reduces to 

R(xi) < ue < R(x E ). 

Given a pair of quality levels (xi,xe), the retailer wants to keep the 
supplier's rent as low as possible and will, therefore, choose to pay him 
the smallest possible information rent; that is, we can conclude that 
ue — R(xj). The self-selection constraint (2) is, thus, slack, provided 
the necessary monotonicity condition (5) holds. 



4 If xj = 0, then sj = 0. If xj = xg, then si = Se- 
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Plugging our findings, uj = and ue = R(xi), into the retailer's objectives 
yields the following reduced program: 



max {/ x [r(xi) - Cj(sj)] + (1 - /) x \r{x E ) - C E (x E ) - R(xi)]} . 

{{xi,Xe)\ XI<Xe} 

The solution is 

x E = x^ = argmax{r(a;) - C E {x)} (9) 

x>0 

XI = x* I (f) = &Ygma X \r(x)^C I (x)-^—^R(x)\. (10) 

x>0 I f J 

The only step left is to verify that the monotonicity condition (5) is satisfied for 
these values. If we consider the last two terms in the maximand of (10) to be 
cost, we see that the effective marginal cost of output from the inefficient type 
is 

C'j (x) + lj y~R' (x) > C'j (x) > C' E (x) 

for x > 0. 5 The greater the marginal-cost schedule given a fixed marginal- 
revenue schedule, the less is traded; that is, it must be that x}(f) < xf — the 
monoticity condition (5) is satisfied. 

It is worth summarizing the nature and properties of the optimal price sched- 
ule for the retailer to propose: 

Proposition 1 The optimal (non-linear) payment schedule for the principal 
induces two possible outcomes depending upon the state of nature such that: 

• the supplier trades the ex post efficient quantity, x^, when it is an efficient 
producer, but trades less than the efficient quantity when it is an inefficient 
producer (i.e., x*j(f) < xf); 

• an inefficient supplier makes no profit (uj — 0), but an efficient supplier 
earns an information rent of R[x}(f)]; 

• the revelation constraint is binding in state E, slack in state I; 

• the participation constraint is binding in state I , slack in state E; 

• x*j(f) and R[x*j(f)] are non- decreasing in the probability of drawing an 
inefficient producer (i.e., are non- decreasing in f ); 

• and, finally, lim /io x}(/) = 0, \im fn x*j(f) = xf , lim fl0 R[xj(f)] = 0, 
and lim /n R [x}(f)} = R(xf). 



5 Since xf, > 0, this is the relevant domain of output to consider. 
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To see that the last two points hold, note first that the effective marginal cost 
of production from an inefficient supplier, 

C' i (x) + ^Ir'(x), 

is falling in /. By the usual comparative statics, this means that x*j (/) is non- 
decreasing. Since R(-) is an increasing function, R[x}(f)] must be similarly 
non-decreasing. As / J, 0, this effective marginal cost tends to +00 for x > 0, 
which means the optimal level of trade falls to zero. As / | 1, this effective 
marginal cost tends to the symmetric-information marginal cost, hence x*j (/) 
tends to the symmetric-information level, xf . 

Intuition for these results can be gained from Figure 1. This figure shows 
one indifference curve for an inefficient (type-/) supplier and three indifference 
curves for an efficient (type-/?) supplier in output-payment space. The type-/ 
indifference curve is that type's zero-profit curve (hence, by necessity, it passes 
through the origin). Correspondingly, the lowest and darkest of the type-E 
indifference curves is that type's zero-profit curve. The faint dash-dot lines 
are iso- profit curves for the retailer (to minimize clutter in the figure, they're 
sketched as straight lines, but this is not critical for what follows). Observe 
that an iso-profit curve is tangent to type-/'s zero-profit indifference curve at 
point A. Likewise, we have similar tangency for type-/? at point B. Hence, 
under symmetric information, points A and B would be the contracts offered. 
Under asymmetric information, however, contract B is not incentive compatible 
for type-/?: Were it to lie and claim to be type-/ (i.e., move to point A), 
then it would be on a higher (more profitable) indifference curve (the highest 
of its three curves). Under asymmetric information, an incentive compatible 
pair of contracts that induce the symmetric-information levels of trade are A 
and C. The problem with this solution, however, is that type-/? earns a large 
information rent, equal to the distance between B and C. The retailer can reduce 
this rent by distorting downward the quantity asked from a type-/ supplier. For 
example, by lowering quantity to Xj(f), the retailer significantly reduces the 
information rent (it's now the distance between B and E). How much distortion 
in quantity the retailer will impose depends on the likelihood of the two types. 
When / is small, the expected savings in information rent is large, while the 
expected cost of too-little output is small, so the downward distortion in type- 
/'s output is big. Conversely, when / is large, the expected savings are small 
and the expected cost is large, so the downward distortion is small. The exact 
location of point D is determined by finding where the expected marginal cost 
of distorting type-/'s output, / x [r 1 (xi) — C\ (xj)], just equals the expected 
marginal reduction in type-ZS's information rent, (1 — /) x R' (xi). 

From Figure 1, it is clear that the retailer loses from being uninformed about 
the supplier's type: Point D lies on a worse iso-profit curve than does point A 
and point E lies on a worse iso-profit curve than does point B. 6 Put another 
way, if the retailer draws a type-/ supplier, then it gets a non-optimal (relative 



Since the retailer likes more output (in the relevant range) and smaller payments to the 
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Figure 1: The symmetric-information contracts, points A and B, are not incentive 
compatible. The symmetric-information quantities, and xf , are too 
expensive because of the information rent (the distance from B to C). 
Consequently, with asymmetric information, the principal trades off a 
distortion in type-/'s output (from xf to x*j(f)) to reduce type-S's 
information rent (from BC to BE). 



to symmetric information) quantity of the good. While if it draws a type-E 
supplier, then it pays more for the optimal quantity (again, relative to symmetric 
information). Part — but only part — of the retailer's loss is the supplier's gain. 
In expectation, the supplier's profit has increased by (1 — /) R [x} (/)]. But 
part of the retailer's loss is also deadweight loss: Trading x*j (/) units instead of 
xf units is simply inefficient. In essence, our retailer is a monopsonist and, as 
is typical of monopsony, there is a deadweight loss. Moreover, the deadweight 
loss arises here for precisely the same reason it occurs in monopsony: Like a 
monopsonist, our retailer is asking the payment (price) schedule to play two 
roles. First, it asking it to allocate goods and, second, it is asking it to preserve 
its rents from having all the bargaining power. Since only the first role has 
anything to do with allocativc efficiency, giving weight to the second role can 
only create allocative inefficiency. As often happens in economics, a decision 
maker has one instrument — here, the payment schedule — but is asking it to 
serve multiple roles. Not surprisingly then, the ultimate outcome is less than 
first best. 

Since the first best is not achieved, it is natural to ask whether the retailer 



supplier, its utility (profits) are greater on iso-profit curves toward the southeast of the figure 
and less on iso-profit curves toward the northwest. 
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could improve on the outcome in Proposition 1 by using some more sophisti- 
cated contract? The answer is no and the proof is, as we will see later, quite 
general. Whatever sophisticated contract the retailer uses, this contract will 
boil down to a pair of points, (xi, sj) and (xe, Se), once it is executed; that is, 
a final quantity traded and a final payment for each possible state of nature. 
Consequently, whatever complicated play is induced by the contract, both par- 
ties can see through it and forecast that the equilibrium outcomes correspond 
to these two points. Moreover, by mimicking the strategy it would play in state 
t, the supplier can generate either of the two outcomes regardless of the true 
state. In addition, if it can't profit from (x tl s t ) in state t, it can simply not par- 
ticipate. Necessary equilibrium conditions are, then, that the supplier choose 
(x t ,s t ) in state t rather than (xt>,s t >), t ^ t' , and that it choose to participate 
anticipating that it will choose (x t ,St) in state t. But these are precisely the 
revelation and participation constraints (2), (3), (6), and (7). Therefore, what- 
ever the contractual arrangement, the final outcome can always be generated by 
a simple (non-linear) payment schedule like the one derived above. We've, thus, 
established that the outcome described in Proposition 1 cannot be improved on 
by using more sophisticated or alternative contracts. 7 

Finally, note that we don't need an entire payment schedule, S (•). In partic- 
ular, there is a well-known alternative: a direct-revelation contract (mechanism), direct revelation 
In a direct-revelation contract, the retailer commits to pay the supplier se for 
xe or si for xi depending on the supplier's announcement of its type. Failure 
by the supplier to announce its type (i.e., failure to announce a t € {E,I}) is 
equivalent to the supplier rejecting the contract. Finally, if, after announcing 
its type, the supplier produces a quantity other than x t ~, the supplier is pun- 
ished (e.g., paid nothing). It is immediate that this direct-revelation contract is 
equivalent to the optimal payment schedule derived above. It is also simpler, in 
that it only deals with the relevant part of the payment schedule. Admittedly, 
it is not terribly realistic, 8 but as this discussion suggests we can transform a 
direct-revelation contract into a more realistic contract (indeed, we will formal- 
ize this below in Proposition 3). More importantly, as we will see, in terms of 
determining what is the optimal feasible outcome, there is no loss of generality 
in restricting attention to direct-revelation contracts. 

4 General Screening Framework 

The two- type screening model yielded strong results. But buried within it is a 
lot of structure and some restrictive assumptions. If we are really to use the 
screening model to understand economic relationships, we need to deepen our 
understanding of the phenomena it unveils, the assumptions they require, and 

7 This is not to say that another contract couldn't do as well. This is rather obvious: For 
instance, suppose S (x) = for all x except xp and X[, where it equals s^ors;, respectively. 
We've merely established that no other contract can do strictly better than the S (•) derived 
in the text. 

8 Although see Gonik (1978) for a real-life example of a direct-revelation mechanism. 
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the robustness of its conclusions. Our approach in this section is, thus, to start 
from a very general formalization of the problem and to motivate or discuss the 
assumptions necessary for making this model "work." 

A principal and an agent are involved in a relationship that can be charac- 
terized by an allocation x € X and a real-valued monetary transfer sGcScl allocation 
between the two players. A transfer-allocation pair, (x, s), is called an outcome, outcome 
The space of possible allocations, X, can be quite general: Typically, as in our 
analysis of the retailer-supplier problem, it's a subspace of M; but it could be 
another space, even a multi-dimensional one. In what follows, we assume that 
outcomes are verifiable. 

The agent's information is characterized by a parameter 9 G O. As before, 
we'll refer to this information as the agent's type. The type space, 0, can be very type space 
general. Typically, however, it is cither a discrete set (e.g., as in the retailer- 
supplier example where we had = {/, E}) or a compact interval in R. Nature 
draws 9 from according to a commonly known probability distribution. While 
the agent learns the value of 9 perfectly, the principal only knows that it was 
drawn from the commonly known probability distribution. 

Both players' preferences are described by von Ncumann-Morgenstern utility 
functions, W(x,s,9) for the principal and Li(x,s,9) for the agent, where both 
are defined over ^x5x6. Since we interpret s as a transfer from principal to 
agent, we assume that U increases in s, while W decreases in s. For convenience, 
we assume these utility functions are smooth; more precisely, that they are 
three-times continuously diffcrcntiable. 9 

To have a fully general treatment, we need to extend this analysis to allow 
for the possibility that the actual outcome is chosen randomly from X x S. To 
this end, let tr denote a generic element of the set of probability distributions, 
A(X x S), over the set of possible outcomes, X x S. We extend the utility 
functions to A (X x S) through the expectation operator: 

W(a, 9) = IE (j [W(x, s, 9)} and U{a, 9) = E ff [U{x, s, 9)] . 

By adding an element to X if necessary, we assume that there exists a no- 
trade outcome (xo,0); that is, (xo,0) is the outcome if no agreement is reached 
(e.g., in the retailer-supplier example this was (0,0)). The values of both W 
and U at this no-agreement point play an important role in what follows, so we 
give them special notation: 

W R (9) = W (x , 0, 9) and U R (9) = U (x , 0, 9) . 

These will be referred to as the Reservation utilities of the players (alternatively, 
their individual rationality payoffs). Obviously, an agent of type 9 accepts a 
contract if and only if his utility from doing is not less than Ur(&). 

It is convenient, at this stage, to offer a formal and general definition of what 
a contract is: 



9 One could relax this smoothness assumption, but the economic value of doing so is too 
small to warrant time on this issue. 
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Table 1: The Retailer-Supplier Example in our General Notation 



Description 


General Notation 


Specific Value 


Allocation space 


X 


M + 


Transfer space 


s 


K 


Outcome function 


a 


All mass on (x, S (x)) 


Principal's strategy space 


M 


{pay S 0)} 


Agent's strategy space 


M 


X = M + 



Definition 1 A contract in the static contractual screening model is a game 
form, (A4, J\f,cr), to be played by the principal and the agent, M. denotes the 
agent's strategy set, Af the principal's strategy set, and a an outcome function 
that maps any pair of strategies (m,n) to a probability mapping on X xS. That 
is, a : M x Af —> A (X x S). 

To make this apparatus somewhat more intuitive, consider Table 1, which 
"translates" our retailer-supplier example into this more general framework. 
Observe that, in the example, the contract fixes a trivial strategy space for the 
principal: She has no discretion, she simply pays S(x). 10 Moreover, there is 
no randomization in that example: a simply assigns probability one to the pair 
(x, S (x)). As this discussion suggests, generality in notation need not facilitate 
understanding — the generality contained here is typically greater than we need. 
Fortunately, we will be able to jettison much of it shortly. 

A direct mechanism is a mechanism in which A4 = O; that is, the agent's direct mechanism 
action is limited to making announcements about his type. The physical con- 
sequences of this announcement are then built into the outcome function, a. 
For instance, as we saw at the end of the previous section, we can translate our 
retailer-supplier contract into a direct mechanism: Now AA = {E,I}, Af is a 
singleton (so we can drop n as an argument of a), and 

rr^-l {XI ' Sl) = (X ' (/) ' Cl [X *t (/)]) if m = 1 

1 j I {x E ,s E ) = {x F E ,R[x*U)]+C E (x F E )) i£m = E ' 

A direct-revelation mechanism (alternatively, a direct truthful mechanism) is a direct-revelation 
direct mechanism where it is an equilibrium strategy for the agent to tell the 
truth: Hence, if m(-) : O — > O is the agent's strategy, we have m{9) = 9 in 
equilibrium for all 9 e O. That is, for any 9 and 9' in 0, 

U{a{m [9]),9)>U(a(m [9']), 9). 

Note that not every direct mechanism will be a direct-revelation mechanism. 
Being truthful in equilibrium is a property of a mechanism; that is, it depends 
on a (•). 



10 We could, alternatively, expand her strategy space to S — she can attempt to pay the agent 
whatever she wants. But, then, the outcome function would have to contain a punishment for 
not paying the agent appropriately: That is, a (x, s) = (x, S (x)) if s = S (x) and equals (x, oo) 
if s ^ S (x) (where oo is shorthand for some large transfer sufficient to deter the principal 
from not making the correct payment). 
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Observe that the design of a contract means choosing A4, Af, and a. In 
theory, the class of spaces and outcome functions is incomprehensibly large. 
How can we find the optimal contract in such a large class? Indeed, given the 
inherent difficulties in even characterizing such a large class, how can we ever 
be sure that we've found the optimal contract? Fortunately, two simple, yet 
subtle, results — the revelation principle and the taxation principle — allow us 
to avoid these difficulties. 11 From the revelation principle, the search for on 
optimal contract reduces without loss of generality to the search for the optimal 
direct-revelation mechanism. Moreover, if the outcome in the direct-revelation 
mechanism is a deterministic function of the agent's announcement, then, from 
the taxation principle, we may further restrict attention to a payment schedule 
that is a function of the allocation x (as we did in the retailer-supplier example) . 

Proposition 2 (The revelation principle) 12 For any general contract (A4, 
Af, a) and associated Bayesian equilibrium, there exists a direct- rev elation mech- 
anism such that the associated truthful Bayesian equilibrium generates the same 
equilibrium outcome as the general contract. 

Proof: The proof of the revelation principle is standard but informative. A 
Bayesian equilibrium of the game (A4,N, a) is a pair of strategics (m(-),n). 13 
Let us consider the following direct mechanism: cr(-) = <r(m(-), n). Our claim is 
that cr(-) induces truth-telling (is a direct- revelation mechanism). To see this, 
suppose it were not true. Then there must exist a type such that the agent 
does better to lie — announce some 0' ^ 9 — when he is type 9. Formally, there 
must exist 9 and 9' ^ 9 such that 

U(a(6'),9) >U{a{9),9). 

Using the definition of cr(-), this means that 

U{a [m{9'),n] , 6) > U{a [to (0) , n] , 0); 

but this means the agent prefers to play m(0') instead of to(0) in the original 
mechanism against the principal's equilibrium strategy n. This, however, can't 
be since m(-) is an equilibrium best response to n in the original game. Hence, 
truthful revelation must be an optimal strategy for the agent under the con- 
structed direct mechanism. Finally, when the agent truthfully reports the state 
of nature in the direct truthful mechanism, the same outcome a (6) = a(m(9),n) 



11 lt is unfortunate that these two fundamental results are called principles, since they 
are not, as their names might suggest, premises or hypotheses. They are, as we will show, 
deductive results. 

12 The revelation principle is often attributed to Myerson (1979), although Gibbard (1973) 
and Green and Laffont (1977) could be identified as earlier derivations. Suffice it to say 
that the revelation principle has been independently derived a number of times and was a 
well-known result before it received its name. 

13 Observe that the agent's strategy can be conditioned on 8, which he knows, while the 
principal's cannot be (since she is ignorant of 0). 
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is implemented in equilibrium. ■ 

An intuitive way to see the revelation principle is imagine that before he 
plays some general mechanism, the agent could delegate his play to some trust- 
worthy third party. There are two equivalent ways this delegation could work. 
One, the agent could tell the third party to play to. Alternatively, if the third 
party knows the agent's equilibrium strategy — the mapping to : O — > M. — then 
the agent could simply reveal (announce) his type to the third party with the 
understanding that the third party would choose the appropriate actions, to (9). 
But, since we can build this third party into the design of our direct-revelation 
mechanism, this equivalence means that there is no loss of generality in restrict- 
ing attention to direct-revelation mechanisms. 

The taxation principle requires a little more structure: It assumes that there 
is a possibility of punishing the agent so as to deter him to violate the contractual 
rules. More specifically, let us consider the following assumption: 

AO (Existence of a punishment): There exists an s el such that: 

sup U(x,s,9) < inf U(x,s,9). 
(x.e)exxe (s,x,0)exxSxe 



In other words, there exists a punishment so severe that the agent would always 
prefer not to suffer it. 

With this assumption, one can construct a payment schedule (in R) that 
generates the same outcome as any deterministic direct-revelation mechanism 
and is, therefore, as general as any contract in this context. 

Proposition 3 (The taxation principle) Under Assumption AO, the equi- 
librium outcome under any deterministic direct-revelation mechanism, a (•) = 
(x(-), s (•)), is also an equilibrium outcome of the game where the principal pro- 
poses the payment schedule S(-) defined by 

S(x) = s(9), when 9 € x^ 1 (x) (i.e., such that x = x(9) for some 9^0) 
S(x) = s, otherwise. 

Proof: Let's first establish that S (•) is unambiguously defined: Suppose there 
existed 9\ and 9 2 such that 

x = x(9i) = x(9 2 ), 

but s(9i) ^ s(9 2 ). We're then free to suppose that s(9i) > s(9 2 )- Then, because 
U is increasing in s, 

U{x{9 1 ),s{9 1 ),9 2 ) > U{x{9 2 ),s{9 2 ),9 2 ). 

But this means the agent would prefer pretending that the state of nature 
is 9 2 when it's actually 6i; the mechanism would not be truthful. Hence, if 
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#1,02 € x^ 1 (x), we must have s(0i) = s(0 2 ); that is, the payment schedule 
S(-) is unambiguously defined. 

Now, the agent's problem when faced with the payment schedule S(-) is sim- 
ply to choose the allocation x that maximizes U(x, S(x), 0). Given the severity 
of the punishment, the agent will restrict his choice to x <G x(O). But since our 
original mechanism was a direct-revelation mechanism, we know 

U [x (9) ,s(9),9)>U(x (9') , s (9') , 0) 

for all 9 and 9' . So no type 9 can do better than to choose x = x (0). ■ 

The economic meaning of the taxation principle is straightforward: When 
designing a contract, the principal is effectively free to focus on "realistic" com- 
pensation mechanisms that pay the agent according to his achievements. Hence, 
as we argued above, there is no loss of generality in our solution to the retailer- 
supplier problem. 

Although payment schedules involve no loss of generality and are realistic, 
the fact that they are often nonlinear means that they can be difficult to work 
with. 14 In particular, when looking for the optimal non- linear price schedule, 
one must be able to compute the functional mapping that associates to each 
schedule S(-) the action choice x{9) that maximizes U(x, S(x),9). Even assum- 
ing S (•) is differentiablc — which is not ideal because one's not supposed to make 
assumptions about endogenous variables — solving this problem can be difficult. 
Direct-revelation mechanisms, on the other hand, allow an easier mathemati- 
cal treatment of the problem using standard convex analysis: The revelation 
constraints simply consist of writing 9' — 9 is a maximum of U(x(9'),s(9'),9) 
and writing that a given point is a maximum is easier that characterizing an 
unknown maximum. For this reason, much of the mechanism-design literature 
has focussed on direct-revelation mechanisms over optimal payment schedules 
despite the latter's greater realism. 

5 The Standard Framework 

As we've already hinted, the general framework introduced in the previous sec- 
tion is more general than what is commonly used. In this section, we introduce 
a "standard" framework, within which much of the contractual screening liter- 
ature can be placed. We begin by defining this framework and contrasting it to 
the more general framework introduced above. At the end of this reading, we 
offer less conventional views on the screening model, which require departing 
from the standard framework. In order to provide a road map to the reader, we 
spell out each assumption in this section and suggest what consequences would 
arise from alternative assumptions. 



14 Admittcdly, if the derived payment schedule is sufficiently nonlinear, one could begin to 
question its realism, as real-world contracts are often linear or, at worst, piecewise linear. 
Although see footnote 8 for an example of where reality "matches" theory. 
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In the standard framework, the allocation space, X , is M + . 15 The type space, 
6, is [9l, Oh] C K, where both bounds, 9l and Oh, are finite. 16 The most critical 
assumptions in going from the general framework to the standard framework in- 
volve the utility functions. Henceforth, we assume they are additively separable 
in the transfer and the allocation. Moreover, we assume the marginal value of 
money is type independent (i.e., d(dU/ds) /d9 = d(dW/ds) jd9 = 0). These 
assumptions simplify the analysis by eliminating income effects from consider- 
ation. Formally, the agent's and principal's utility functions are 

U(x,s,6) = s + u(x, 9) and 
W(x,s,0) = w(x,0)-8, 

respectively. Essentially for convenience, we take u(-, •) and w (•, •) to be three- 
times continuously differentiablc. The aggregate (full-information) surplus is 
defined as: 

fl(x,6) = w(x,0) +u(x,6). 
We also assume, mainly for convenience, that 

1. Some trade is desirable: For all 6 e {0 L ,0 H ], dfl (0, 6) /dx > 0. 

2. There can be too much of a good thing: For all 6 € [0l,0h] 3x (8) 
such that n (x, 6) < Ct (0, 9) for all x > x (9). 

Observe these two assumptions entail that f2 (x, 9) has an interior maximum for 
all 9 e (6l,@h]- If the first assumption didn't hold for at least some types, then 
trade — contracting — would be pointless. Extending the desirability of trade to 
almost all types saves from the bookkeeping headache of distinquishing between 
types with which trade is efficient and those with which it is not. The second 
assumption is just one of many ways of expressing the sensible economic idea 
that, beyond some point, welfare is reduced by trading more. 

Observe that the standard framework is restrictive in several potential im- 
portant ways: 

• The type space is restricted to be one-dimensional. In many applica- 
tions, such as the retailer-supplier example, this is a natural assumption. 
One can, however, conceive of applications where it doesn't fit: Suppose, 
e.g., the retailer cared about the quantity and quality of the goods re- 
ceived and the supplier's type varied on both an efficiency dimension and 
a conscientiousness-of-employees dimension (the latter affecting the cost 
of providing quality). Not surprisingly, restricting attention to one di- 
mension is done for analytic tract ability: Assuming a single dimension 



15 That the space be bounded below at is not critical — any lower bound would do. Alter- 
natively, by appropriate changes to the utility functions, we could allow the allocation space 
to be unbounded. Zero is simply a convenience. 

16 This constitutes no loss of economic (as opposed to mathematical) generality, since we 
can set the bounds as far apart as we need. 
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make the order properties of the type space straightforward (i.e., greater 
and less than are well-defined on the real line). As we will see, the order 
properties of the type space are critical to our analysis. 

• The set of possible allocations is one-dimensional. Again, this is sufficient 
for some applications (e.g., the quantity supplied to the retailer), but not 
others (e.g., when the retailer cares about both quantity and quality). 
The difficulty in expanding to more than one dimension arise from diffi- 
culties in capturing how the agent's willingness to make tradeoffs among 
the dimensions (including his income) varies with his type. The reader 
interested in this extension should consult Rochet and Chone (1998). 

• As noted, the utility functions are separable in money and allocation; the 
marginal utility of income is independent of the state of nature; and the 
marginal utility of income is constant, which means both players are risk 
neutral with respect to gambles over money. The gains from these as- 
sumptions are that we can compute the transfer function s(-) in terms of 
the allocation function x(-), which means our optimization problem is a 
standard optimal-control problem with a unique control, x(-). In addition, 
risk neutrality insulates us from problems that exogenously imposed risk 
might otherwise create (e.g., the need to worry about mutual insurance). 
On the other hand, when the agent is risk averse, the ability to threaten 
him with endogenously imposed risk (from the contract itself) can provide 
the principal an additional tool with which to improve the ultimate allo- 
cation. For a discussion of some of these issues sec Edlin and Hcrmalin 
(1997). Note we still have the flexibility to endogenously impose risk over 
the allocation (the x), we discuss the desirability of doing so below. See 
also Maskin (1981). 

Continuing with our development of the standard framework, we assume 
that nature chooses the agent's type, 9, according to the distribution function 
F(-) : [9l,0h] — ► [0,1]- Let /(•) be the associated density function, which 
we assume to be continuous and to have full support (i.e., f (9) > for all 
()€ [9 L , 9 H \). Assuming a continuum of types and a distribution without mass 
points is done largely for convenience. It also generalizes our analysis from just 
two types. Admittedly, we could have generalized beyond two types by allowing 
for a finite number of types greater than two. The conclusions we would reach 
by doing so would be economically similar to those we'll shortly obtain with a 
continuum of types. 1 The benefit of going all the way to the continuum is it 
allows us to employ calculus, which streamlines the analysis. 

Recall that, at its most general, a direct-revelation mechanism is a mapping 
from the type space into a distribution over outcomes. Given the way that 
money enters both players utility functions, we're free to replace a distribution 
over payments with an expected payment, which means we're free to assume that 



See, e.g., Caillaud and Hermalin (1993), particular §3, for a finite-type-space analysis 
under the standard framework. 
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the payment is fixed deterministically by the agent's announcement. 18 What 
about random- allocation mechanisms? The answer depends on the risk proper- 
ties of the two players' utilities over allocation. If we postulate that w(-,9) and 
u(-,9) are concave for all 9 € [9l,9h], then, absent incentive concerns, there 
would be no reason for the principal to prefer a random-allocation mechanism; 
indeed, if at least one is strictly concave (i.e., concave, but not affine), then 
she would strictly prefer not to employ a random-allocation mechanism absent 
incentive concerns: Her expected utility is greater with a deterministic mecha- 
nism and, since the agent's expected utility is greater, her payment to him will 
be less (a benefit to her). Hence, we would only expect to see random-allocation 
mechanisms if the randomness somehow relaxed the incentive concerns. Where 
does this leave us? At this point, consistent with what is standardly done, we 
will assume that both w(-,9) and u(-,9) are concave, with one at least being 
strictly concave, for all 9 e [9l,9h] (note this entails that ft (-,9), the social 
surplus function, is also strictly concave). Hence, absent incentive concerns, 
we'd be free to ignore random-allocation mechanisms. For the time being, we'll 
also ignore random- allocation mechanisms with incentive concerns. Later, we'll 
consider the circumstances under which this is appropriate. Since we're ignoring 
random mechanisms, we'll henceforth write (x(-),s(-)) instead of a (•) for the 
mechanism. 

Within this framework, the route that we follow consists of two steps. First, 
we will characterize the set of direct-revelation contracts; that is, the set of 
contracts (x(-),s(-)) from [9l,9h] to R x R + that satisfy truthful revelation. 
This truthful-revelation — or incentive compability — condition can be expressed 
as: 

s(9) + u[x(9),9}> s(9) + u x(9),9 (11) 

for all (9,9) £ [0l,9h] 2 - After completing this first step, the second step is 
identifying from within this set of incentive-compatible contracts the one that 
maximizes the principal's expected utility subject to the agent's participation. 
Whether the agent participates depends on whether his equilibrium utility ex- 
ceeds his reservation utility, Ur(9). Observe that the requirement that the 
agent accept the contract imposes an additional constraint on the principal 
in designing the optimal contract. As before, we refer to this constraint as 
the participation or individual-rationality (IR) constraint. Recall that X is as- 
sumed to contain a no-trade allocation, xo, s — is a feasible transfer, and 
Ur(9) =U (xo, 0,9). Hence there is no loss of generality in requiring the prin- 
cipal to offer a contract that is individually rational for all types (although the 
"contract" for some types might be no trade). 

We assume that the agent acts in the principal's interest when he's otherwise 
indifferent. In particular, he accepts a contract when he is indifferent between 
accepting and rejecting it and he tells the truth when indifferent between being 
honest and lying. This is simply a necessary condition for there to exist an 



18 That is, the mechanism that maps 9 to a distribution G (9) over payments is equivalent 
to a mechanism that maps 9 to the deterministic payment s (9) = {s}. 



Caillaud and Hcrmalin The Standard Framework 



20 



cquilibrum and, as such, should not be deemed controversial. 

In our treatment of the retailer-supplier example, we assumed that both 
types of supplier had the same reservation utility (i.e., recall, Ur (0) = U (0, 0, 0) = 
—C'e (0) = 0). It is possible, however, to imagine models in which the reserva- 
tion utility varies with 9. For instance, suppose that an efficient supplier could, 
if not employed by the retailer, market its goods directly to the ultimate con- 
sumers (although, presumably, not as well as the retailer could). Suppose, in 
fact, it would earn a profit of tte > from direct marketing. Then we would 
have Ur(E) = tte and Ur(I) = 0. A number of authors (see, e.g., Lewis 
and Sappington, 1989, Maggi and Rodriguez-Clare, 1995, and Jullien, 1996) 
have recently studied the role of such type- dependent reservation utilities in 
contractual screening models. Type dependence can, however, greatly compli- 
cate the analysis. We will, therefore, adopt the more standard assumption of 
type-independent reservation utilities; that is, we assume — as we did in our 
retailer-supplier example — that 

U R {9) = U R and W R {9) = W R 

for all 9 £ [9l,9h\- As a further convenience, we will interpret x — as the 
no-trade allocation. Observe that these last two assumptions imply 

u (0, 6) = u (0, 6') = U R and 

to (0,0) = w{o,e') = w R 

for all 9, 9' e [0l,9h]- Given these assumptions, we can express the agent's 
participation constraint as 

s{9)+u[x{9),9]>U R . (12) 

Although our treatment of reservation utilities and no trade is standard, we 
have, nevertheless added to the assumptions underlying the standard framework. 

At last, we can state the problem that we seek to solve: Find the optimal 
contract (x (•) , s (•)) that maximizes 

f" (w[x(9),9]- s (9))f(9)d9 (13) 

subject to (11) and (12) holding. 

Before solving this program, it's valuable to consider the full (symmetric) 
information benchmark: Ex post efficiency corresponds to adopting the alloca- 
tion 

x F (9) € arg max fl(x, 9) 

for each 9 (recall f2(x, 0) is the aggregate surplus from the relationship). Our 
earlier assumptions ensure that fi(-,0) is strictly concave for each 0; so the ex 
post efficient allocation is uniquely defined by the first-order condition: 

|^.«-£(^ + £(*'(»>,»>-o. 
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Our earlier assumptions also entail that x F '(•) is uniformly bounded from above. 
Observe that any sharing of the surplus can, then, be realized by the appropriate 
transfer function. 19 It follows that, if the principal knows 9 — that is, the parties 
are playing under full (symmetric) information then the contracting game can 
be easily solved: In equilibrium, the principal offers a contract (x F (-), s F (-)) 
such that the agent's utility is exactly equal to his reservation utility; i.e., 

s F {9) = U R -u{x F {9),9). 

In other words, the principal captures the entire surplus — a consequence of 
endowing her with all the bargaining power — leaving the agent at his outside 
(non-participation) option. 

5.1 The Spence-Mirrlees Assumption 

Before we proceed to solve (13), we need to introduce one more assumption. 
Given this assumption's importance, it is worth devoting a short section to it. 

In order to screen types, the principal must be able to exploit differences 
across the tradeoffs that different types are willing to make between money 
and allocation. Otherwise a strategy, for instance, of decreasing the x expected 
from the agent in exchange for slightly less pay wouldn't work to induce one 
type to reveal himself to be different than another type. Recall, for instance, 
because the marginal cost of output differed between the efficient and inefficient 
types in our retailer-supplier example, we (the buyer) could design a contract 
to induce revelation. Different willingness to make tradeoffs means we require 
that the indifference curves of the agent in x-s space differ with his type. In 
fact, we want, for any point in x-s space, that these slopes vary monotonically 
with whatever natural order applies to the type space. Or, when, no natural 
order applies, we want it to be possible to define an order, >-, over the types so 
that 9 >~ 9' if and only if the slope 9's indifference curve is greater (alternatively 
less) than the slope of 9"s indifference at every point (x, s) e X x S. Such 
a monotonicity-of-indifference-curves condition is known as a Spence-Mirrless 
condition and, correspondingly, the assumption that this condition is met is Spence-Mirrlees 
known as the Spence-Mirrlees assumption. 

The slope of the indifference curve in x-s space is equal to —du/dx. Hence, 
we require that —du/dx or, equivalently and more naturally, du/dx vary mono- 
tonically in 9. Specifically, we assume: 

Al (Spence-Mirrlees Assumption): For all x £ X, 

du {x, 9) du {x, 9') 
dx dx 



19 Note that this is an instance of where we're exploiting the additive separability (lack of 
income effects) assumed of the utility functions. Were there income effects, we couldn't define 
x F (•) independently of the transfers. 
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if 9 > 9' (note 9 C R) 20 

That is, if > 9' — 9 is a higher type than 9' — then —1 times the slope of type 
#'s indiffernce curve is, at any point, greater than —1 times the slope of type 
9 n s indifference curve. Observe that a consequence of Assumption Al is that 
a given indiffence curve for one type can cross a given indifference curve of 
another type at most once. For this reason, Assumption Al is sometimes called 
a single-crossing condition. Figure 2 illustrates. single-crossing 

If, as assumed in the standard framework, 6 = [6l,&h] and u(-, •) is three- 
times differentiablc, then Assumption Al is equivalent to 

Al (Standard framework Spence-Mirrlees Assumption): For all (x, 9) e 

R + x[6 L ,6 H }, 

d 2 u(x,9) 

d9dx 



Economically, the Spence-Mirrlees assumption tells us that the agent's mar- 
ginal benefit from increasing x is increasing in his type. As an example of all 
this, recall our retailer-supplier model. There, the efficient, E, type was the 
higher type (i.e., E >~ I). Recall too that u(x,9) = —Ce(x). Recall as well 
our assumption (definition) that the efficient type had the lower marginal cost. 
Putting this all together, we see that Assumption Al holds for retailer-supplier 
model. 

It is important to understand that the Spence-Mirrlees assumption is an 
assumption about order. Consequently, differentiability of u with respect to 
cither x or 9 is not necessary. Nor, in fact, is it necessary that U be additively 
separable as we've been assuming. At its most general, then, we can state the 
Spence-Mirrlees assumption as 

Al' (General Spence-Mirrlees Assumption): There exists an order on 
6 such that if 9' y g 9" , then 

U {x\ s', 9") > U (x", a", 9") =► U {x', a', 9') > U (x", a", 9') , 

whenever x' y x x" (where a', a" <G S and y x completely orders X). 



20 Thc assumption that 9 C 1 is not critical. For suppose that i 6 T were some natural or 
intuitive definition of type, where T had no natural order or was ordered by something other 
than >. Suppose that for any pairs of types t and f'eT that 

( du(x,t) du{x,t')\ ^ (du{x',t) du{x',t'Y _ 



V dx dx J \ dx dx 

for any pair of allocations x and x' 6 X. Then observe that by picking a specific x, we can 
define 

ox 

This new type space, Q = 8 (T) C R is thus isomorphic to our original type space plus which 
it meets the Spence-Mirrlees assumption. 
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Indifference curve 

for high type 

Indifference curve 
S for low type 




x 



Figure 2: The Spence-Mirrlees Assumption: Through any point (e.g., A or B), 
the indifference curve through that point for the high type cross the 
indifference curve through that point for the low point from above. 



This generalized Spence-Mirrlees assumption states that we can order the types 
so that if a low type (under this order) prefers, at least weakly, an outcome with 
more x (with "more" being defined by the order y x ) than a second outcome, 
then a higher type must strictly prefer the first outcome to the second. Figure 
2 illustrates: Since the low type prefers point C to A (weakly), the high type 
must strictly prefer C to A, which the figure confirms. Similarly, since the 
low type prefers C to B (strictly), the high type must also strictly prefer C to 
B, which the figure likewise confirms. 21 See Milgrom and Shannon (1994) for 



21 Observe, as shown in Figure 2, that the area above the higher type's indifference curve 
to the right of given point (A or B) is larger than the area above the lower type's indifference 
curve to the right of that point. This suggests an alternative statement of Al'. For a given 
point (xo,so), define Tio = {(x,s) \x ^ x xo} (i.e., Tio is the right half-plane defined by the 
vertical line x = xq) and define 

Vo (0) = {(x,s) \U(s,x,0) >U(s o ,x o ,0)} 

(i.e., Vo (0) are the outcomes preferred by a type-0 agent to (xq, so))- Then Al' is equivalent 
to 

v (<?') n Ho c Vo (e) n n 

for any point (xo,sq) when yg 0'. 
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a more complete discussion of the relationship between Assumption Al' and 
Assumption Al. 

As suggested at beginning of this sub-section, the consequence of the Spence- 
Mirrlees assumption (stated either as Al or Al') is that it is possible to separate 
any two types; by which we mean it is possible to find two outcomes (x\, s\) and 
(x 2 , s 2 ) such that a type-0i agent prefers (xi, si) to (x 2 , s 2 ), but a type-# 2 agent 
has the opposite preferences. For instance, in Figure 2, let point A be (xi,si) 
and let D be (x 2 ,s 2 ). If #2 is the high type and 9\ is the low type, then it is 
clear that given the choice between A and D, 9\ would select A and # 2 would 
select D; that is, this pair of contracts separates the two types. Or, for example, 
back in Figure 1, contracts D and E separate the inefficient and efficient types 
of supplier. 

5.2 Characterizing the Incentive-Compatible Contracts 

Our approach to solving the principal's problem (13) is a two-step one. First, 
we will find a convenient characterization of the set of incentive-compatible 
mechanisms (i.e., those that satisfy (11)). This is our objective here. Later, we 
will search from within this set for those that maximize (13) subject to (12). 

Within the standard framework it is relatively straightforward to derive the 
necessary conditions implied by the self-selection constraints. Our approach 
is standard (see, e.g., Myerson, 1979, among others). Consider any direct- 
revelation mechanism (x(-),s(-)) and consider any pair of types, B\ and 62, 
with 61 < 8 2 . Direct revelation implies, among other things, that type Q\ won't 
wish to pretend to be type 02 and vice versa. Hence, 

s(0i)+u[a;(0i),0i] > s(e 2 )+u[x(e 2 ),e 1 } and 
s(9 2 )+u[x(02),e2] > s(6 1 )+u[x(6 1 ),9 2 ]. 

As is often the case in contract theory, it is easier to work with utilities than 
payments. To this end, define 

v(O) = 8(O) + u[x(6),0\. 

Observe that v (0) is the type-# agent's equilibrium utility. The above pair of 
inequalities can then be written as: 

> v(6 2 ) - u[x(0 2 ),6> 2 ] +u[x(6 2 ),e 1 ] and 

v{e 2 ) > v{e 1 )-u[x{e 1 ),e 1 ] + u[x{e 1 ),e 2 ]. 

Or, combining these two inequalities, as 

£ ! ^^ M <v W -^ ) <£ 9 ^^^ ,14) 
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This double inequality has two consequences. First, ignoring the middle 
term, it implies 

/ / ^-^(x,0)dxd0>O. 

J Ox Jx{Px) dxd0 

The Spence-Mirrlees assumption, Al, means the integrand is positive. Given 
9i < 6*2, this means the integral can be non-negative only if x{6i) < x(9 2 ). 
Since this is true for any 0\ < 9 2 , we may conclude that the allocation function 
x(-) is non- decreasing. Note that this necessarily implies that x(-) is almost 
everywhere continuous. 

The second consequence of (14) is as follows: By fixing one end point and let- 
ting the other converge towards it, we see that «(•) is absolutely continuous with 
respect to Lebesgue measure and is, thus, almost everywhere differcntiable. 22 
This derivative is 

dv (9) _ du(x(9),6) 
d9 ~ 09 

almost everywhere. 23 Consequently, one can express equilibrium utility as 

v(9) = v(9 L )+ J° ?£(x(t),t)dt. (15) 

Expression (15) and the monotonicity of the allocation function x(-) are, 
thus, necessary properties of a direct-revelation mechanism. In particular, we've 
just proved that a necessary condition for an allocation function x (•) to be 
implcmcntablc is that 

s{6) = v L -u{x{6),6)+ / —(x(t),t)dt. 

where vl is an arbitrary constant. 

It turns out that these properties are also sufficient: 

Theorem 1 ( Characterization of direct-revelation mechanisms ) Within 
the standard framework and under Assumption Al, a direct mechanism (x (•) , s (•)) 
is truthful if and only if there exists a real number vl such that: 

f e du 

a(6) = v L -u{x{9),9)+ ( X (t),t)dt (16) 

Je L o9 

and x(-) is non-decreasing. (17) 

Consequently, an allocation function x(-) is implementable if and only if it is 
non- decreasing. 

22 To be precise, one should make assumptions to ensure that du(x(-), )/d0 is intcgrable on 
©. This can be done by requiring that X be bounded or that du(-, )/d0 be bounded on A* X0. 
Both assumptions are natural in most economic settings and simply extend the assumptions 
that bound x F '(•). Henceforth, we assume du/80 is bounded. 

23 Note the important difference between dn l x W< ^ an( j du[x ($),$] ^ rpj^ f ormer j s tne p ar ti a i 
derivative of u with respect to its second argument evaluated at (x (8) , 9), while the latter is 
the total derivative of u. 
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Proof: Since we established necessity in the text, we need only prove sufficiency 
here. Let (x(-),s(-)) satisfy (16) and (17). Consider the agent's utility when 
the state of nature is 9, but he claims that it is 9' > 9: 

.(*') 



( e du 

s{6') + u{x{6'),6) =v L -u{x {9') ,9')+ / —(x(t),t)dt +u{x{9'),9) 

J e,, 09 



-v L -u(x (9), 9) 



du 
86 



(x(t),t)dt+u(x(9),9) 



(18) 



s(8) 



+ [u{x{9'),9)-u{x (9'), 9')] + 



(x(t),t)dt- / —(x(t),t)dt 



Ol 



89 



Ol 



89 



(19) 



= v(0) + 



du . . . . du , .... . 

-Wt),t)--(x(«'),t) 



dt 



Where the second equality (beginning of (18)) derives from adding and sub- 
tracting s(9). In the last line, the first term, v (9), is (18) and the second term, 
the integral, is (19). Since we've assumed (17), x(t) < x{9') for t € [6, 9']. More- 
over, Al implies 8u/89 is increasing in x. Hence, the integral in the last line is 
non-positive; which means we may conclude 

s(9') + u(x(9'),9) <v(9). 

That is, under this mechanism, the agent does better to tell the truth than 
exaggerate his type. An analogous analysis can be used for 9' < 9 (i.e., to show 
the agent does better to tell the truth than understate his type). Therefore, the 
revelation constraints hold and the mechanism is indeed truthful. SB 



This characterization theorem is, now, a well-known result and can be found, 
implicitly at least, in almost every mechanism design paper. Given its impor- 
tance, it is worth understanding how our assumptions drive this result. In par- 
ticular, we wish to call attention to the fact that neither the necessity of (15) 
nor (16) depends on the Spence-Mirrlees assumption. The Spcnce-Mirrlees as- 
sumption's role is to establish that a monotonic allocation function is necessary 
and that, if x (•) is monotonic, then (16) is sufficient to ensure a truth-telling 
equilibrium. 

To further illustrate these points let's consider an alternative approach to the 
revelation constraints inspired by Guesnerie and Laffont (1984). The standard 
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framework continues to apply. Assume, however, that x(-) is piecewise twice con- 
tinuously differentiable. 24 The revelation constraint, expressed as (14), shows 
this property extends to the transfer function, s(-). Now, let U(8', 0) denote the 
agent's utility when he claims to be type 0' but is really type 0. The revelation 
constraint can, thus, be written: 

v(0)=mzxU(0',0) = U(0,0). (20) 

Applying the envelope theorem to this program yields 

f e {0) = U,{0',0) \ e , =e =^(x(0),0) 

(for all but a finite number of #). 25 The integral expressions (15) and (16) fol- 
low; hence, as claimed, the relationship between the transfer function and the 
allocation function in a direct-revelation mechanism does not depend upon the 
Spence Mirrlees assumption. In fact, these integral expressions are simply de- 
duced from (20) 's first-order condition, Ui(0,0) = 0. Of course, we also need to 
pay attention to the the second-order conditions. Given our differentiability as- 
sumptions, the second-order conditions reduce to U(0' , 0) being locally concave 
in 0' around the point 0' = 0, that is: 

U n (0,0)<O 

(for all but a finite number of 0). Observe that, differentiating the first-order 
condition with respect to 0' , 

U 11 (0,0) + U 12 (0,0)=O; 

hence the local concavity condition is, therefore, equivalent to: 

O<U 12 (0,0)= dxde •»(*) 

(for all but a finite number of 6*). The specific role played by the Spence-Mirrlees 
assumption now emerges: The assumption allows one to translate the second- 
order condition implicit in the revelation program (20) into a monotonicity 
condition on the function x(-). 

This discussion also demonstrates a point that was implicit in our earlier 
discussion of the Spence-Mirrlees assumption: What is critical is not that 

24 "Piecewise" means that the property is true except at a finite number of points. The 
approach in Gucsncric and Laffont (1984) actually requires just that x (•) be piecewise contin- 
uously differentiable. Note that, since x (•) is ultimately endogenous, making any assumptions 
about it is less than ideal. If, however, we show that the optimal (second-best) x (•) has these 
properties, then there's no harm done. 

25 When working with U (•,•) it is helpful to use the notation C7< to denote the partial 
derivative with respect to the ith argument. Uu denotes the second partial derivative with 
respect to the ith argument and U%j denotes the cross partial derivative. 
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be positive, but rather that it keep a constant sign over the relevant domain. 
If, instead of being positive, this cross-partial derivative were negative every- 
where, then our analysis would remain valid, except that it would give us the 
inverse monotonicity condition: x(-) would need to be non-increasing in type. 
But with a simple change of the definition of type, 9 = —9, we're back to our 
original framework. Since, as we argued above, the definition of type is some- 
what arbitrary, we see that our conclusion of a non-decreasing x (•) is simply 
a consequence of the assumption that different types of agent have different 
marginal rates of substitution between money and allocation and that an order- 
ing of these marginal rates of substitution by type is invariant to which point 
in X x S we're considering. 

What if the Spence-Mirrlees assumption is violated (e.g., -j^g changes sign)? 
As our discussion indicates, although we still have necessary conditions concern- 
ing incentive-compatible mechanisms, we no longer have any reason to expect 
x (•) to be monotonic. Moreover — and more critically if we hope to characterize 
the set of incentive-compatible mechanisms — we have no sufficiency results. It 
is not surprising, therefore, that little progress has been made on the problem 
of designing optimal contracts when the Spence-Mirrlees condition fails. 

5.3 Optimization in the standard framework 

The previous analysis has given us, within the standard framework at least, a 
complete characterization of the space of possible (incentive-compatible) con- 
tracts. We can now concentrate on the principal's problem of designing an 
optimal contract. 

Finding the optimal direct-revelation mechanism for the principal means 
maximizing the principal's expected utility over the set of mechanisms that 
induce truthful revelation of the agent's type and full participation. From page 
20, the participation constraint is (12); while, from the previous section, truthful 
revelation is equivalent to (16) and (17). 26 We can, thus, express the principal's 
problem as 



Once again, it's more convenient to work with v(-) than s(-). Observe that 



Moreover, (16) can be used to compute v(9) using only x(-) and a number vl- 




subject to (12), (16), and (17). 



w [x (9) ,9]-s{9) = w[x (9) ,6]-v(6)+u[x (9) , 9} 
= fl(x(9),9)-v(9). 



26 Assuming Al holds, which wc will do henceforth. 
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Hence, we're free to write the principal's problem as 



[w(x(0),O)-s(0)]f(0)d0 



<>L 



n{x{6),0) 

Integration by parts (or Fubini's theorem) 27 implies 



[Q(x(0),0)-v(0)]f(9)d6 

du 



de 



(x(t),t)dt 



f(O)d0 - v L 



du 
89 



(x(t),t)dt 



f(O)M = -J° B [l-F(e)]^(x(O),O)d0, 



which allows us to further transform the principal's objective function: 



p6 H 

Je r , 



[w(x(9),9)-s(9)]f(9)d9- 



n(x(9),e)- 1 - 1 ^l^(x(e) 7 e) 



f (9) d9- 



Remark 1 From this last expression, we can see that it is unreasonable to 
expect to achieve the first best: The principal's objective function differs from 
the first-best objective function, maxE^ {Q (x (9) , 9)}, by 



L 



Consequently, since the principal wishes to maximize something other than ex- 
pected social surplus and the principal proposes the contract, we can't expect the 
contract to maximize social surplus. 



Define 



£(a;,0) = Cl(x,9) - [1 )] ^(x, 9). 



f{9) 89' 

Observe that our earlier assumptions ensure that S(x, 9) is bounded and at least 
twice-differentiable. We will refer to S (x, 9) as the virtual surplus (this follows 
Jullicn, 1996). 28 

The principal's problem can now be restated in a tractable and compact 
form: 



max 



subject to «i 



X(x(9),9)f(0)d9-v L 
9 8u, 



(21) 



88 



(x(t),t)dt > Ur 



and x(-) is non-decreasing. 



27 Both arc equivalent in the present case, but Fubini's theorem is perhaps more appropriate 
since it extends to multi-dimensional frameworks as well. Note too our reliance on du/dO being 
bounded (see footnote 22). 

28 Guesnerie and Laffont (1984) and Caillaud et al. (1988) use the term surrogate welfare 
Junction. 
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Before solving this program generally (a problem we return to in Section 
5.5), we will first solve the problem under three additional assumptions: 

1. For all x, du/d9 > {i.e., utility is non-decreasing in type); 

2. £ (-, 9) is strictly quasi-concave for all 9 e [9 L , 9 H \; and 

3. dT,/dx is non-decreasing in 9 for all x. 

What are the consequences of these assumptions? The first entails that the 
participation constraint holds for all types if it holds for the lowest type, 9^. 
Consequently, we can ignore this constraint for all but the lowest type. More- 
over, for this type, the constraint reduces to vl > Ur. Since vl is a direct 
transfer to the agent without incentive effects, we know the principal will set it 
as low as possible. That is, we can conclude that, optimally, vl = Ur. Note, too, 
this means the participation constraint is binding for the lowest type (similarly 
to what we saw in the retailer-supplier example). To summarize: 

Lemma 1 If utility is non- decreasing in type for all allocations (i.e., du/d9 > 
for all x), then (i) v L = Ur; (ii) the participation constraint is binding for the 
lowest type, 9 L ; and (Hi) the participation constraint holds trivially (is slack) 
for all higher types (i.e., for 9 > 9l). 

In light of this lemma, we can be emboldened to try the following so- 
lution technique for (21): Ignore the monotonicity constraint and see if the 
unconstrained problem yields a monotonic solution. The solution to the uncon- 
strained problem, x* (•), is to solve (21) pointwise; that is, to set x* (9) = X (9), 
where 

X{6) = argmaxS(x,6») . 

X 

Note that the second assumption means X (•) is uniquely defined. Finally, the 
third assumption — the marginal-benefit schedule, dYi/dx, is non-decreasing in 
9 — means the point at which (x, 9) jdx crosses zero is non-decreasing in 9. 
But this point is X (9); hence, monotonicity is ensured. To conclude: 

Proposition 4 If 

• for all x, du/d9 > 0; 

• £(-,#) is strictly quasi- concave; and 

• dYi/dx is non- decreasing in 9; 

then the solution to (21) is x* (9) = X (9) and Vl = Ur. 

How does the solution in Proposition 4 compare to the full-information 
benchmark? The answer is given by the following corollaries: 



Corollary 1 x* (9) < x F (9) for all 9 E [9 L ,9 H ) and x* (9 H ) = x F (9 H )- 
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Proof: At x = x F (9), 



8Z l-F (0) d 2 u 

x 



dx f(9) dxde' 

Since (i) 1 — F (6) > (except for = Oh) and (ii) the cross-partial deriva- 
tive is strictly positive by Al, the right-hand side is negative for all 9, except 
9 H . Consequently, since £(-,0) is strictly quasi-concave, we can conclude that 
x* (9) < x F (0) for all 9 e [9 L ,6 H ). For = H , we've just seen that 

d^[x F {9 H ),9 H ] =q 
dx 

hence, the strict quasi-concavity of £ (-,0h) ensures that x F (Oh) is the maxi- 
mum. H 



Corollary 2 v' (9) > and v (9 L ) = U R . 

Proof: We've already established the second conclusion. The first follows since 
v(9) = v L + ^ —( x (t),t)dt; 

so 

v'(e) = ^[x(6),6]>0 
by the first assumption. ■ 

In much of the literature, the three additional assumptions supporting Propo- 
sition 4 hold, so Proposition 4 and its corollaries might be deemed the "standard 
solution" to the contractual screening problem within the standard framework. 
These conclusions are sometimes summarized as 

Remark 2 Under the standard solution, there's a downward distortion in allo- 
cation (relative to full information) for all types but the highest, the lowest type 
earns no information rent, but higher types may. 

Observe the "may" at the end of the last remark becomes a "do" if 

du[X(0),0] 

do 

for all 0>0 L . 29 

29 There are, of course, many assumptions that will change the "may" to a "do." 
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5.4 The retailer-supplier example revisited 



Before returning to our fairly abstract analysis of contractual screening, let's 
consider an extension of our earlier retailer-supplier example. Specifically let's 
imagine that there are a continuum of efficiency types, which we normalize to 
be the interval [1,2]. Instead of C t (x), write the supplier's cost function as 
C (x, 9), and suppose that 

ls<* < 22 > 

that is, higher (more efficient) types have lower marginal costs. Since 

u(x,6) = -C(x,6) , 

(22) implies that the Spence-Mirrlees assumption is met. In addition, to these 
assumptions, we are maintaining all the assumptions from our earlier model. 
In particular, we assume the revenue function, r(-), is concave and bounded. 
Hence, since C(-,9) is strictly convex, 

lim U (x, 9) = — oo for all 9. 

x—>oo 

Assume, too, that dQ (0, 9) jdx > for all 9; i.e., x F (9) > for all 9. It is read- 
ily checked that all the assumptions of the standard framework are, therefore, 
satisfied. 

Since C(-,8) is a cost function, we necessarily have C(0,9) = for all 9. 
Combined with (22), this entails that C(x,0) > C(x,6') if 9 < 9'. 30 Hence, we 
may conclude 

9u (x,9)>0. 



89 



(23) 



Observe that 



Y,(x,6) = r(x)-C(x,6) 



Hence, 



as (x, 9) 



r' (x) 



dC_ 
dx 



^-F{9) 
f(0) 

l-F{9) 



-dC 



-d 2 c 



dxd9 



dx w dx f{6) 

It is clear, therefore, that to take advantage of Proposition 4, we need to know 
something about the shape of dC(-,9) jd9 (e.g., is it at least quasi-concave) 
and how the Mills ratio 31 and the cross-partial derivative change with respect 
to 9. To this end, let's impose two frequently made assumptions: 



30 Proof: The initial condition, C (0, i) = for all t 6 [1, 2], means we can write 



fj 
JO J 6 



8' 



d 2 C 



e dBdx 



dOdx; 



C(x,6') -C(x,9) 

the result follows from (22). 

31 The Mills ratio is the ratio of a survival function (here, 1 — F (8)) to its density function 
(here, f (0)). Because the Mills ratio is the inverse of the hazard rate, it is also known as the 
inverse hazard rate (this formal distinction between the inverse and "regular" hazard rate is 
not always respected by economic theorists). 



Mills r atio 
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• C (x,6) — h (9) c (a;), where h (•) is positive, strictly decreasing, and convex 
— higher types have lower marginal costs, but this marginal cost advantage 
may be less pronounced when moving up from one high type to another 
than it is when moving up from one low type to another. The function 
c(-) is strictly increasing, strictly convex, and c(0) =0. 

• Let M (6) denote the Mills ratio. Assume M' (6) < 1. 

Given these assumptions, we may conclude that £ (•, 9) is globally strictly con- 
cave and that 

d 2 ^ g) = -ti (9) c' (x) + M' (9) ti (9) c' (x) + M (9) h" (9) c' (x) 
cx ti (9) [M' (9) - 1] + M (9) h" (9) > 0. 

That is, we may conclude that E (•, 9) admits a unique maximum, which is non- 
decreasing with type. Combined with (23), this means we can apply Proposition 
4. 

For example, suppose r (x) — x, h (9) = 1/9, c (x) = x 2 /2, and F (9) = 9 — 1 
(i.e., the uniform distribution on [1,2]). Both bullet points are met, so we can 
apply Proposition 4. This yields x* (9) = X (9), where 

X{9) solves 1- X - -(2-0) J = 0; 

orX(9) = \9\ 

From Proposition 4, we may set vl = Ur, which is zero in this model. Noting 
that du/89 = x 2 /29 2 , we thus have 



v(0) = 0+ / ^dt 
Je L 



9 t 2 



2t 2 

dt 

l » 
24 24 



Hence, the transfer function is 



s 



(9) = v(9)-u(x(9),9) 



3 



Observe that 



1 n3 1 6 

— 9 s h 

24 24 
#3 _ 1_ 
¥ ^ 24' 



x* (9) = l -9 2 < 9 = x F (9) 
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for all 9 < 2; that x* (2) = 2 = x F (2); that v (1) = 0; and that v (9) > for all 
9 > 1 — all consistent with the corollaries to Proposition 4. 

Finally, although there's no need to do it, we can check that the incentive- 
compability constraints are indeed satisfied by the mechanism {\9 2 , ^9 3 — 



max U 



H = 



max -9 A 

e 6 24 

9 2 9 3 
Y-29=°- 



¥ 2 



29 



Clearly, 9=9 satisfies the first-order condition (since U (-,0) is clearly concave, 
the second-order conditions arc also met). 

By the taxation principle (Proposition 3), an alternative to this direct- 
revelation contract is a payment schedule. Noting that a;*" 1 (x) = V2x, an 
optimal payment schedule, S(x), is 



S(x) 



s[x*-i(x)\=^-i i forxe[l,2] 
for x i [\,2] 



5.5 General conditions for solving the principal's problem 

We have seen that the principal's problem (21) has a simple and straightforward 
solution if du/d9 is non-negative, £(•,#) is strictly quasi-concave, and dT,/dx 
is non-decreasing in 9 (Proposition 4). In this section, we explore solving (21) 
under more general assumptions. 

We begin, first, with the question of whether a solution to (21) exists. 32 
Our assumptions on w(-,9) and u(-,9) ensure that Q(-,9) is concave and has 
an interior maximum. The latter conclusion means x F (-) is bounded. This is 
relevant since, as Jullicn (1996) shows, if > 0, then a bounded x F (•) implies 
that we can ignore any allocation function such that x(9h) > max^ge { xF (8)}- 
Hence, by the monotonicity condition, we can then conclude that the optimal 
x (•) is bounded. A bounded x (•) helps to ensure the contract space is compact, 
which is sufficient for existence of an optimal contract. 

Theorem 2 (Jullien (1996)) Assume the standard framework, Al, and 

— (x,9)>0 V(x,9)eXxe, (24) 
then there exists an optimal contract and the optimal allocation is bounded. 



32 Some articles that deal with the existence question arc Gucsncric and Laffont (1984); 
Page (1992); Jullien (1996) and Rochet and Chone (1998). Each considers slightly different 
assumptions and offer varying degrees of generality (including relaxing some of the standard 
framework assumptions). Here, we follow Jullien's approach. 
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Proof: See Jullien (1996). ■ 

As we saw in the previous section, (24) is a natural assumption in many 
contexts indeed, as in the last example, it can be a consequence of the Spence- 
Mirrlees assumption. It is not, however, always so straightforward. For instance, 
suppose that, as in Lewis and Sappington (1989), we had 



u{x,6) = -C(x,0) = 



(6-k)x-K(6) if x > 
if x = 



that is, there is a fixed (overhead) cost associated with production that is also 
a function of type. 33 An interpretation of this cost function is that there is 
a technology frontier that involves tradeoffs between marginal and fixed costs, 
with the consequence that a type that has low marginal costs has high fixed costs 
and vice versa (i.e., K' (6) > 0). Hence, while the Spence-Mirrlees assumption 
is clearly met, (24) could fail to hold. 34 

Existence is one thing. Characterizing the solution and determining the 
uniqueness of the solution are another. If 

£(-,#) is strictly quasi-concave, (25) 

then the solution must be unique. Moreover, making this assumption facilitates 
characterizing the solution, as we saw in Proposition 4. Indeed, we know of no 
research in which this assumption is not made. Like lemmings, we will follow 
the herd in this regard. Nonetheless, it is worth taking a moment to consider 
what this assumption entails. There are many economic justifications we can 
give to ensure strict quasi-concavity of the true surplus function, ft (-,9). But 
the virtual surplus function, £(•,#), differs from the true surplus function by 
an amount 

l-F{9) du(-,9) 

JW) oe~' 

As a general proposition, it is difficult to argue from economic principles that 
—du(-,9) jd9 should be strictly quasi-concave. 35 In specific cases, admittedly, 
one can appeal to economic principles: If, as we did in our extended retailer- 
supplier example, one is willing to postulate u exhibits separability — 

u{x,6) = h (x)h(0) + b 2 (x) + K(9), 



33 Note this cost function violates our earlier maintained assumption of continuity at x = 0. 

34 Note that we could move the K (0) term into the reservation utility; i.e., set Ur (8) = 
K (9). Indeed, there is often an isomorphism between models in which (24) fails, but Al holds, 
and models with type- dependent reservation utilities. Put loosely, given that we're "truly" in 
the standard framework, we can generally expect (24) to hold if Al holds. 

35 Of course, it is not necessary that both components of £(-,#) be strictly quasi-concave 
for E (-,9) to be strictly quasi-concave. However, it is sufficient and certainly an easy way to 
verify E (•, 8) is strictly quasi-concave. 
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with h! (9) < — then strict quasi-concavity of — du (•, 9) jd9 is ensured by the 
same economic principles that justify u being strictly quasi-concave. But be- 
cause separability is not a generic property, nor a natural economic property, it 
is worth appreciating that (25) need not be an innocuous assumption. 

Given (24) and (25), we have two of the three assumptions (in addition to 
the standard framework and Al) required by Proposition 4. The third, that the 
marginal virtual surplus be non-decreasing in type — i.e., that 

dT, 

— be non-decreasing in 9 — (26) 

can be a fairly stringent condition. Writing out marginal virtual surplus, 

<9£ _ dw du 1-F (9) d 2 u 
dx dx dx f (9) dxd9 ' 

we see that only one of our maintained assumptions — Spence-Mirrlees — applies 
and it helps with only one term (the middle) . Of course in many models, such 
as our extended retailer-supplier model, the principal's utility, w, is not directly 
a function of type, so dw/dx is trivially non-decreasing in type. This leaves the 
last term. Using M (•) to denote the Mills ratio, a sufficient condition for the 
third term to be non-decreasing in type is 

MUm^ + M (<)) *^<„. (27, 

The Mills ratio must be positive and, by Spence-Mirrlees, the cross-partial 
derivative must also be positive. Hence, sufficient conditions for (27) to be 
valid are that M' (9) and the third derivative be non-positive. It is difficult to 
tell a compelling economic story for why the third derivative should be non- 
positive; hence, this is a problematic assumption. Whether M' (9) < 0, known 
as the monotone hazard rate property, depends on the underlying distribution 
assumed for the types. 36 Of course, (26) is a only a sufficient condition. If 
in maximizing S (x,9) we discovered that the function X (•) is non-decreasing, 
the result in Proposition 4 would still hold true. The problem is, except when 
working with specific functional forms, it is rather difficult to directly assess the 
monotonicity of X (•). 

What if X (•) is not monotonic? Then, although the principal would like to 
impose a contract with x (0) = X (9), we know that such an allocation function 
won't be incentive compatible. In this case, intuition suggests that the principal 
will try to design a contractual allocation function x(-) as close as possible to 
X(-), but subject to the condition that x(-) be non-decreasing. Put differently, 
the constraint that x (•) be non-decreasing must now bind over some interval(s) 
of types and, hence, we must pay explicit attention to it. But if this condition 



36 Some distributions that satisfy the monotone hazard rate property arc the uniform, the 
exponential, the logistic, and the normal. An example of a distribution not satisfying this 
property is the Pareto distribution. 
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binds, then we must have x' [9) = for some interval; that is, we lose the full 
separation across types we had before (e.g., in our retailer-supplier models). 
Such non-separation is called bunching. bunching 

Deriving the optimal contract when X (•) is non-monotonic is a standard 
optimal-control problem. Note that before we can tackle this control problem, 
we need to be assured that we can look for the optimal x (•) within the class of 
absolutely continuous functions. This level of technicality is, however, beyond 
our scope here and the interested reader is directed to Jullien (1996). We will 
simply assume the optimal x (•) is absolutely continuous. Hence, the principal's 
program can be written as: 



ax [ * Z(x{9),6)f{9)d6 
>!/(■) Je, 



max 

*(.) 

u T 

s.t. — (0) = y(0), for all 9 
and y(8) > 0, for a.c. 9. 

In the language of optimal-control problems, x(-) is the state variable, y(-) is 
the control variable, and the program imposes a positivity constraint on the 
control. Introducing the co-state variable A(-), we obtain the Hamiltonian for 
this problem: 

H [x(-),y(-),\(-),6] = E [x(9),6] f(6) + A(%(0). 

The necessary first-order conditions are then: A(-) is absolutely continuous and 
non-positive, A(0) = A(l) = 0, 

y{6) eargmax[£(a;(0),0)/(6>) + A(%], and (28) 

y>0 

Condition (28) is particularly trivial since it points towards simple results: if 
X(9) < over some interval, then y(9) must be equal to which means that the 
allocation function x(-) must be constant — there is bunching over this interval. 
And if \(9) = over some interval, then obviously ^ — and x(9) = X(9) over 
this interval. It follows that the optimal allocation function £*(•) is obtained 
by piecing together the increasing parts of X(-) and constants, so that x*(-) is 
continuous. We may conclude that: 

Proposition 5 ( Characterization of the optimal contract with bunch- 
ing) Within the standard framework and assuming Al, (24), and (25), the op- 
timal contract necessarily satisfies the following: The allocation function x*(-) 
is continuous, bounded, and for almost all 9 either 



• x*(9)=X{8); or 
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• x*(6) is constant, equal to Xi, over some interval (&i,0i) such that for all 

I -—(xi,t)dt<Oand / — — (x^, t)dt = 0. 
Je> dx Je t 9x 

Remark 3 This proposition only provides necessary conditions, as the mono- 
tonicity condition may introduce non-convexities in the optimization problem. 
If, however, we assume thatT,(-,8) is strictly concave, then the above conditions 
are necessary and sufficient. 

Figure 3 illustrates. 

5.6 Random-allocation mechanisms 

Remember that we have heretofore ruled out random-allocation mechanisms 
by fiat; that is, without considering whether random-allocation mechanisms 
could be superior to deterministic mechanisms. Here, we briefly reconsider 
when this is appropriate. Recall from our earlier discussion that (i) additively 
separability plus risk-neutrality over money mean there is no point to consider 
random-payment mechanisms and (ii) absent incentive effects, there is no point 
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to random- allocation mechanisms. We're not, however, in a world without incen- 
tive effects; hence, we might ask whether a random-allocation mechanism eases 
the truth-telling constraints for the principal enough to compensate for the risk 
imposed (recall both principal and agent are risk-averse — at least weakly — with 
respect to allocations). 

If we suppose that du/d9 is convex (at least weakly) in x — an assumption 
we've essentially made before to ensure the strict quasi-concavity of the vir- 
tual surplus function — then the answer is no. Given this assumption plus the 
assumed strict concavity of Cl (-, 6), we see that 

E(.,0)-OM)--^-- 

is a concave function. Let x (•) be any monotonic deterministic-allocation func- 
tion and let x(-) be any random-allocation function with the property that 
E{x (8)} = x (9). Then Jensen's inequality implies 

E m {0),9]}<E[x (9), 9} 

for all 9. That is, taking into account the incentive constraint, the principal's 
expected utility is less, type by type. Hence, randomizing based on a feasi- 
ble deterministic-allocation mechanism cannot improve the principal's expected 
utility. See Maskin (1981) for an analysis when du/d9 is not convex. 



6 The Hidden-Knowledge Model 

In this section, we consider a model that, at first, seems quite different than con- 
tractual screening, but which ultimately shares many similarities to it. In par- 
ticular, we consider the hidden-knowledge model. In this model, unlike above, hidden-knowledge 
the principal and agent are symmetrically informed at the time they enter into 
a contractual arrangement. After contracting, the agent acquires private infor- 
mation (his hidden knowledge). As an example, suppose the principal employs 
the agent to do some task — for instance, build a well on the principal's farm — 
initially, both parties could be symmetrically informed about the difficulty of 
the task (e.g., the likely composition of the rock and soil, how deep the water 
is, etc.). However, once the agent starts, he may acquire information about how 
hard the task really is (e.g., he alone gains information that better predicts the 
depth of the water). 

This well-digging example reflects a general problem. In many employment 
situations, the technological, organizational, market, and other conditions that 
an employee will face will become known to him only after he's been employed 
by the firm. This information will affect how difficult his job is, and, thus, his 
utility. Similarly, think of two firms that want to engage in a specific trade 
(e.g., a parts manufacturer and an automobile manufacturer who contract for 
the former to supply parts meeting the latter's unique specifications). Before the 
contract is signed, the supplier may not know much about the cost of producing 



Caillaud and Hcrmalin 



The Hidden-Knowledge Model 



40 



the specific asset and the buyer may have little knowledge about the prospects 
of selling the good to downstream consumers. These pieces of information will 
flow in during the relationship — but after contracting — and once again a hidden- 
knowledge framework is more appropriate for studying such a situation. 

This difference in timing is reflected in the participation constraint for this 
problem. When considering a contract, (x (•) , s (•)) , the agent compares his 
expected utility if he accepts the contract, 

E{u(6)} = E{s(6) + u[x(6),6]}, 

to his expected utility if he refuses the contract; that is, to Uf, = E{Ur(6)}. 
Acceptation or refusal of the contract cannot depend upon the, as yet, unrealized 
state of nature: Unlike the screening model, the participation decision is not 
contingent on type. 

Why is this discussion so important? Because it turns out that, at least in the 
standard framework, the hidden-knowledge model has an extremely simple solu- 
tion. To see this, we invoke the assumptions of the standard framework, includ- 
ing the Spence-Mirrlees assumption, Al. In addition, we assume — consistent 
with the assumptions of the standard framework — that dfl/dx is non-decreasing 
in 9. Consequently, we know the first-best allocation, x F (-), is non-decreasing. 
We can then be sure from Theorem 1 that there exists a transfer function, s F (•), 
such that (x F (•) , s F (•)) is a direct-revelation mechanism. 37 Note that because 
s F (•) is defined by (16), it is defined up to a constant that can be chosen by the 
principal to ensure the agent meets his participation constraint. In particular, 
it can be chosen so that the agent's expected utility equals his non-participation 
expected utility: 

r&H 

[§ F (9)+u(x F (9),9)]f(9)d9 = / U R {9)f{9)d9. 
With this mechanism, the principal's expected utility becomes: 

I" [w{x F {9),9) - s F (9)] f(9)d9 = f H [n(x F (9),9) - U R {9)} f(9)d9 

= f H n(x F (9),9)f(9)d9-U^. 

J6t. 



Since the ex post efficient allocation — that is, x F (•) — maximizes the integrand 
in the last integral, the principal obtains the highest possible expected utility 
with this mechanism. Hence, (x F (•) ,s F (•)) is optimal and we see, therefore, 
that the first-best allocation will be achieved with hidden-knowledge, in contrast 
to the less desirable equilibrium of the screening model: 



37 The transfer function s F (•) is a priori different from the full-information transfer func- 
tion, s F (•), because the latter is not subject to revelation constraints. 
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Proposition 6 In a hidden-knowledge model, in which the agent fully commits 
to the contract before learning his type, which satisfies the assumptions of the 
standard framework, and in which dQ/dx is non- decreasing in 9, then the equi- 
librium allocation is the ex post efficient allocation. 

To gain intuition for this result, return to our two- type example from Section 
3, but suppose now that the agent doesn't learn his type until after contracting 
with the principal. Observe that we can reduce the agent's information rent on 
average by paying him less than his full-information payment if he announces 
he is the low type {i.e., type /) and more than his full-information payment if 
he announces he is the high type {i.e., type E): Now set the payments to be 

sf — Cj (xf) — 7; and 
sf = C E (xf) + n. 

Since the agent doesn't learn his type until after contracting, the participation 
constraint is 

/ x (sf - Cj (xf)) + (1 - /) x (sf C E (xf)) = -/ 7 + (1 - /) n 

> 0. 

We also need direct revelation, which, for type /, means 

sf-C I (xf) > sf - Ci (xf) ; or 

-7 > V+[C E (xf)-Cj(xf)]. 

Treating these two constraints as equalities, we can solve for 7 and n: 

T] = f x [Cj (xf) - C E (xf)] ; and 
7 = (1 - f) x [Cj (xf) - C E (xf)] . 

Provided these also satisfy type E's revelation constraint, we're done. But they 
do, since 

sf - Ci (xf) = 7] 

= -7 + Cj (xf) - C E (xf) 
> - 1 + Cj(xf)-C E (xf) 
= §f-C E (xf) 

(the inequality follows because, recall, Cj (•) — Ce (•) is increasing). 

Note the phrase "in which the agent fully commits to the contract" that 
constitutes one of the assumptions in Proposition 6. Why this assumption? 
Well suppose that, after learning his type, the agent could quit (a reasonable 
assumption if the agent is a person who enjoys legal protections against slavery). 
If his payoff would be less than Ur (9) if he played out the contract, he would 
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do better to quit. 38 To keep the agent from quitting, the principal would have 
to design the contract so that U (9, 9) > Ur (9) for all 9. But then this is just 
the screening model again! In other words, the hidden-knowledge model reverts 
to the screening model — with all the usual conclusions of that model — if the 
agent is free to quit (in the parlance of the literature, if interim participation 

constraints must be met). Even if anti-slavery protections don't apply (e.g., the interim participation 

agent is a firm), interim participation could still matter; for instance, in the last 

example, if 7 is too big, then the agent may not have the financial resources 

to pay it (it would bankrupt him). Alternatively, in nations with an English 

law tradition, 7 could be perceived as a penalty, and in many instances the 

courts will refuse to enforce contracts that call for one party to pay a penalty to 

another. In short, because interim participation constraints are often a feature 

of the real world, many situations that might seem to fit the hidden-knowledge 

model will ultimately prove to be screening-model problems instead. 

7 Concluding Remarks 

The screening model and variants, such as the hidden-knowledge model, are 
widely used models in economics. These models capture a fundamental tension 
in many contractual settings: One party has superior information about a state 
of nature relevant to both. Like any advantage, the party with the superior 
information will seek to capture some rents from this. In response, the other 
party, particularly if she has bargaining power to preserve, will seek to design 
a contract that limits the rents of the better-informed party. As we saw, in 
general, this will lead to distortions in physical allocations and, hence, create 
deadweight loss. 
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